High-performance GRID Database Manager for Scientific Data

نویسندگان

  • Tore Risch
  • Milena Ivanova
  • Bo Thidé
چکیده

The GRID initiative provides an infrastructure for distributed computations among widely distributed high-performance computers. This will allow for exchanging and processing very large amounts of data. The LOFAR project (www.nfra.nl/lofar) is an international initiative to build a versatile, geographically distributed, multi-point radio facility for astrophysics, space physics, atmospheric physics, and radio research, utilizing very high performance GRID computing. LOIS is a proposed Swedish outrigger to LOFAR providing a software radar. As the volume of processed data by LOFAR/LOIS is very large and dynamic there will be need for very high performing data management systems. For this a high-performance stream-oriented distributed data manager and query processor is being developed that allows very efficient execution of database queries to streamed data involving numerical and other data. Very high performance is attained by utilizing many object-relational main-memory database engines running on PCs and connected through the GRID. The project leverages upon a highperformance, extensible, and object-oriented database engine, the Amos II kernel, developed in the Uppsala Database Laboratory. A very high performing stream-oriented DBMS is being developed for representing and querying non-relational data representations extracted from the data flows used in space and environmental physics applications. Of particular interest is the development of new distributed data population and query processing techniques for this kind of applications and thereby utilizing distributed and scalable data structures for high-performance stream data processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High-Performance GRID Stream Database Manager for Scientific Data

In this work we describe a high-performance stream-oriented distributed database manager and query processor under development that allows efficient execution of database queries to streamed data involving numerical and other data. Very high performance is attained by utilizing many object-relational main-memory database engines running on PCs and connected through the GRID.

متن کامل

An interoperable & optimal data grid solution for heterogeneous and SOA based Grid- GARUDA

Storage plays an important role in sufficing the requirements of data intensive applications in a Grid computing environment. Current Scientific applications perform complex computational analysis, and consume/produce hundreds of terabytes of data. The authors in this paper have surveyed available data grid solutions, viz., Storage Resource Broker (SRB), Grid File System (GFS), Storage Resource...

متن کامل

Performance Evaluation of MySQL 5.0 and Berkeley DB XML as a Grid Resource Information Manager (GRIM) with a Benchmark/Workload

A challenge in the distributed middleware that implements a grid envisioned to span the world, is the management of information about the resources available to the Grid. This paper describes an experimental study we undertook to better understanding the performance of the native XML database Berkeley DB XML, as a grid resource information manager system compare to MySQL 5.0. We run a benchmark...

متن کامل

The Design and Performance Evaluation of a Lock Manager for a Memory-Resident Database System

In the last fteen years, lock managers for regular disk-based database systems have seen little change. This is not without reason, since traditional memory-resident lock managers have always been much faster than disk-based database storage managers and disk-based database systems had few alternative design options. However, the introduction of memory-resident database systems has created both...

متن کامل

E2DR: Energy Efficient Data Replication in Data Grid

Abstract— Data grids are an important branch of gird computing which provide mechanisms for the management of large volumes of distributed data. Energy efficiency has recently emerged as a hot topic in large distributed systems. The development of computing systems is traditionally focused on performance improvements driven by the demand of client's applications in scientific and business domai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002